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(57) Abstract 

Methods of designing or modifying pro- 
tein structure at the protein or genetic level to 
produce specified amino termini in vivo or in vit- 
ro. The methods can be used to alter the meta- 
bolic stability and other properties of the protein 
or, alternatively, to artificially generate authentic 
ammo-termini in proteins produced through ar- 
tificial means. The methods are based upon the 
introduction of the use of artificial ubiquitin- 
protein fusions, and the discovery that the in vivo 
half-life of a protein is a function of the amino- 
terminal amino acid of the protein. 
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METHODS OF REGULATING METABOLIC 
STABILITY OF PROTEINS 

Backcyround of the Invention" 

In both bacterial and eukaryotic cells, rela- 
tively long-lived proteins, whose half-lives are 
close to or exceed the cell generation time, coexist 
with proteins whose half-lives can be less than one 
percent of the cell generation time. Rates of 
intracellular protein degradation are a function of 
the cell's physiological state, and appear to be 
controlled differentially for individual proteins. 
In particular, damaged and otherwise abnormal 
proteins are metabolically unstable in vivo. 
Although the specific functions of selective protein 
degradation are in most cases still unknown; it is 
clear that many regulatory proteins are extremely 
short-lived in vivo. Metabolic instability of such 
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proteins allows for rapid adjustment of their 
intracellular concentrations through regulated 
changes in rates of their synthesis or degradation. 
The few instances in which the metabolic instability 
of an intracellular protein has been shown to be 
essential for its function include the ell protein 
of bacteriophage lambda and the HO endonuclease of 
the yeast Saccharomyces cerevisiae * 

Most of the selective turnover of intracellular 
proteins under normal metabolic conditions is 
ATP-dependent and (in eukaryotes) nonlysosomal. 
Recent biochemical and genetic evidence indicates 
that, in eukaryotes, covalent conjugation of ubi- 
quitin to short-lived intracellular proteins is 
essential for their selective degradation. The 
rules which determine whether a given protein is 
met cyclically stable dr unstable in vivo were 
previously unknown. 

Summary of the Invention 

This invention pertains to methods of 
engineering the amino terminus of proteins thereby 
controlling the metabolic stability and other 
properties of a protein. Further, this invention 
provides a method for either in vivo or in vitro 
production of proteins with any of the twenty amino 
acid residues (or analogs thereof) at the protein's 
amino-terminus . The invention is based in part upon 
the striking discovery that the in vivo half -life of 
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an intracellular protein is a function of its 
amino-terminal amino acid residue and upon a novel 
(and more generally applicable) technique that 
allows one to generate proteins with specified 
amino-termini in vivo or in vitro.- The invention 
also pertains to a newly identified protease, 

* 

ubiquitin-specif ic processing protease, which has 
properties. that allow one to expose, either in vitro 
or in vivo , any desired amino acid residue at the 
amino-terminus of a protein of interest • 

The nature of the amino acid exposed at the 
amino-terminus of an intracellular protein was shown 

■ 

to be one crucial determinant that specifies whether 
a protein will be long- or short-lived in vivo * 
Individual amino acids can be categorized as either 
stabilizing or destabilizing amino acids with 
respect to the half-life that they confer upon a 
protein when exposed at the protein's amino- 
terminus. Destabilizing amino acid residues confer 
short half -lives, down to a few minutes for some of 
the destabilizing amino acids, stabilizing amino 
acid residues confer long half-lives of many hours. 
This striking and newly discovered' dependency of a 
protein 1 s half -life on its amino-terminal residue is 
referred to herein as the N-end rule. 

Based upon the N-end rule, the amino-terminus 
of a protein can thus be designed or altered to 
change the intracellular half-life of the protein 
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and in this way the lifetime and/or activity of the 
protein in vivo can be regulated. This capability 
can be exploited for rational protein design in many 
different contexts. Natural or wild type proteins 
can be modified to render them more or less 
resistant to degradation in vivo . The design or 
alteration of the protein can be done at the protein 
level or at the genetic (DNA) level. For example, 
proteins can be modified by chemically altering or 
engineering the amino- terminus to provide for 
exposure at the amino-terminus of an amino acid 
residue of the stabilizing or destabilizing class. 
At the genetic level, .genes encoding proteins can be 
made to encode an amino acid of the desired class at 
the amino-terminus so that the expressed protein . 
exhibits a predetermined amino-terminal structure 
which renders it either metabolically stable or 
unstable with respect to the N-end rule pathway of 
proteolytic degradation. Furthermore; proteins can 
be expressed fused to a "masking" protein sequence 
which masks the engineered amino-terminus so that 
when unmasked the protein will exhibit the desired 
stability or other properties that depend on the 
nature of the protein's amino-terminal residue, in 
such constructs, for example, the junction between 
the two protein sequences can be designed to be 
cleaved specifically, for instance, by an endopro- 
tease. Endoproteolytic cleavage of the fused 
sequence unmasks the 
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specifically engineered amino-terminus of the' 
protein of interest and subjects the protein to 

* 

degradation governed by the N-end rule. One spe- 
cific and new way to engineer the protein's amino- 
terminus is provided in this invention by the 
identification of ubiquitin-specif ic processing 
protease and determination of its substrate speci- 
ficity. Using this protease , fusions of ubiquitin 
with other proteins can be specifically processed 
either in vitro or in vivo to generate proteins with 
desired amino-terminal residues. 

A different, and also new way to specifically 
Engineer short-lived proteins is provided in this 
invention by the discovery that ubiquitin-protein 

* 

fusions, such as ubiquitin-Pro-^3 -galactosidase, 
that cannot be efficiently deubiquitinated, are 
metabolically unstable. Thus, by attaching the 
amino-tenainal ubiquitin moiety to a protein in a 
way that makes its removal either impossible or 
inefficient, one can destabilize proteins by a 
distinct technique that is not directly based on the 
N-end rule. 

In addition, variant cells can be developed 
which contain putative mutations in the "N-end fl 
degrading protease (s) which either conditionally or 
nonconditionally stop degrading short-lived pro- 
teins. These cells can be used to overproduce 
proteins that ordinarily would be short-lived within 
the cell. 
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Brief Description of the Figures 

Figure 1 shows the construction of ubiquitin- 
lacZ gene fusions . 

Figure 2 shows experiments in which the half- 
lives of engineered ^ -gal proteins are directly 
measured. 

Figure 3 shows the changing of amino acid 
residues at the ubiquitin- ^gal junction (A) using 
the newly discovered properties of ub iqui tin- 
specific processing protease and the amino acid 
sequence in the vicinity of the junction (B) . 

Figure 4 shows the presence of multiple ubi- 
quitin moieties in metabolically unstable ^-gal 
proteins* 

Figure 5. shows a series of y3-gal species 
containing ubiquitin in metabolically unstable 
p-gal proteins. 

Figure 6 shows that both prokaryotic and 
eukaryotic long-lived intracellular proteins have 
stabilizing amino acid residues at their amino 
termini whereas secreted proteins exhibit a com- 
plementary bias. 

Detailed Description of the Invention 

The elucidation of the IT-end rule is described 
in detail below. Briefly, this rule governing 
protein degradation was revealed by examining the in 
vivo half-lives of the enzyme y?-galactosidase having 
various amino acid residues at its amino -terminus 
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m 

and produced as a fusion protein with ubiguitin. 
When a chimeric gene encoding a ubiquitin-^ -galac- 
tosidase fusion protein is expressed in the yeast S. 

* 

cerevisiae , ubiquitin is cleaved off the nascent 
fusion protein, yielding a deubiquitinated-^-galac- 
tosidase ( ^gal) . With one exception, this clea- 
vage takes place regardless of the nature of the 
amino acid residue of ^gal at the ubiquitin-^ gal 
junction, thereby making it possible to expose 
selectively different residues at the amino-termini 
of otherwise identical ^gal proteins. The fS gal 
proteins so designed exhibited strikingly different 
half-lives in vivo , ranging from more than 20 hours 
to less than 3 minutes, depending upon the nature of 
the amino acid at the amino-terminus of ^sgal. 
Amino acids can be thus ordered according to the the 
half lives they confer on ^gal when present at its 
amino- terminus . For example, the amino acids 
methionine, serine, alanine, threonine, valine, 
glycine and cysteine confer a half-life of more than 
20 hours. Phenylalanine, leucine, asparagine, and 
lysine yield half lives of about three minutes. 
Arginine, the most destabilizing amino acid, confers 
a half life of about two minutes. (See Table 1 below 
for complete list of amino acids and the correspond- 
ing half -lives) - 

Currently known amino-terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins from both prokaryotes and eukaryotes belong 



WO 88/02406 



PCT/US87/02522 



-8- 

virtually exclusively to the stabilizing class of 
amino acids, exactly as predicted by the N-end rule. 
This result stongly implicates the N-end rule in the 
selective degradation of intracellular proteins in 
general. 

The appropriate amino-terminal amino acid 
appears to be an essential (though not necessarily a 
sufficient) requirement for the metabolic stability 
of a noncompartnaentalized, intracellular protein. 
Thus, in order for a protein to be relatively stable 
intracellular^ a stabilizing amino acid should be 
present at the amino -terminus . The presence of a 
destabilizing residue at the amino-terminus of a 
protein is often, though not always, sufficient for 
its metabolic destabilization in vivo . When such 
destabilization occurs to a relatively small extent, 
further analysis shows either an insufficient 
accessibility of the amino-terminus or a lack of 
"permissible" sequence environment in the vicinity 
of the amino-terminus such as a lack of segmental 
mobility in the protein's amino-terminal region. 
The presence of a stabilizing amino acid at the 
amino-terminus at least in some cases (for instance, 
as observed for ^-gal) will confer stability upon 
the protein. However, a stabilizing amino acid at 
the amino-terminus may not always confer a long 
half-life because other degradative pathways may be 
involved in determining the ultimate fate of pro- 
tein. For example, endoproteolytic cleavages 
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(cleavages outside of terminal ' regions of the 
. protein) may result in exposure of a destabilizing 
amino acid at the amino-terminus of a resulting 
product of the cleavage which is then rapidly 
degraded via the N-end rule pathway. Appropriate 
circumstances for use of a stabilizing amino acid 
can be ascertained empirically. 

Although the N-end rule may be only one com- 
ponent (albeit a central one) of a more complex 
"half-life rule" which embraces other aspects of 
selective protein degradation in vivo , the N-end 
rule provides a rational, practicable approach for 
designing or changing protein structure in order to 
produce proteins which are more or less resistant to 
degradation by the N-end rule pathway than natural, 
unmodified protein. Proteins can be' designed or 
modified at the protein or gene level to provide a 
desired amino acid of either the stabilizing or 
destabilizing class at their amino-terminus. The 
ability to regulate the half-life of a protein will 
allow one to modulate the intracellular activity of 
the protein. 

A straightforward approach to modifying a 
protein to increase or decrease its metabolic 
stability is to directly engineer the amino-terminus 
of the protein at the protein level. To provide a 
desired amino-terminal amino acid, the amino- 
terminus of the protein of interest can be chemical- 
ly altered, for example, by adding an amino acid of 
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the stabilizing or destabilizing class to the 
amino-terminus of a protein or polypeptide, em- 
ploying an appropriate chemistry. Thus, for ex- 
ample, an unstable protein can be rendered more 
stable by adding a stabilizing amino acid residue 
(e.g. methionine serine, alanine, threonine, valine 
glycine or cysteine) to the amino-terminus of the 
protein. Conversely, a stable protein can be 
destabilized by adding a destabilizing amino acid to 
the amino-terminus. one distinct way to modify the 
amino-terminus of a protein would be to employ spe- 
cific enzymes, amino acid-protein ligases, which 
catalyze posttranslational addition of a single 
amino acid to the protein amino-terminus. Other 
methods for nongenetic alterations of the same type 
can readily be ascertained by those skilled in the 



In some proteins, the amino- terminal end is 
obscured as a result of the protein's conformation 
(i.e., its tertiary or quaternary structure). In 
these cases, more, extensive alteration of the 
amino-terminus may be necessary to make the protein 
subject to the N-end rule pathway. For example, 
where simple addition or replacement of the single 
amino-terminai residue is insufficient because of an 
inaccessible amino-terminus, several amino acids 
(including lysine, the site of ubiquitin joining to 
substrate proteins) may be added to the initial 
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amino-terminus to increase the accessibility and/or 
segmental mobility of the engineered amino-terminus . 

.Modification or design of the amino-terminus of 
a protein can also be accomplished at the genetic 
level. Conventional techniques of site directed 
mutagenesis for addition or substitution of ap- 
propriate codons to the 5 1 end of an isolated or 
synthesized gene can be employed to provide a 
desired amino-terminal structure for the encoded 
protein. For example, so that the protein expressed 
has the desired amino acid at its amino-terminus the 
appropriate codon for a stabilizing amino acid can 
be inserted or built into the amino-terminus of the 

* 

protein-encoding sequence. 

At the same time, expressed proteins are often 
naturally modified within a cell after translation. 
This can include modification of the amino-terminus. 
For example, the amino-terminus can be acted on by 
an aminopeptidase which cleaves one or several amino 
acids from the amino-terminus, Amino acids may also 
be added to the amino-terminus by post-translational 
processing. This invention provides a way to 
"by-pass" still undefined rules of amino-terminal 
protein processing to expose exactly and specifical- 
ly the desired amino acid residues at the amino- 
terminus of a mature processed protein species. To 
minimize the impact of such posttranslational events 
on the ultimate structure of the amino-terminus of a 
protein of interest, specific fusion proteins can be 
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designed wherein the amino-terminus of a protein of 
interest (designed to have the desired stabilizing 
or destabilizing structure) is preceded by a 
"masking" protein sequence fused to the amino- 
terminus. The fusion proteins are designed so that 
the protein sequence fused to the amino-terminus of 
the masking protein of interest is susceptible to 
specific cleavage at the junction between the two. 
Removal of the protein sequence thus unmasks the 
amino-terminus of the protein of interest and the 
half-life of the released protein is thus governed, 
by the predesigned amino-terminus. The fusion 
protein .can be designed for specific cleavage in 
vivo, for example, by a host cell endoprotease or 
for specific cleavage in a in vitro system where it 
can be cleaved after isolation from a producer cell 
(which lacks the capability to cleave the fusion 
protein) . 

Ubiquitin is a broadly useful fusion partner 
for construction of a fused protein with a protein 
of interest: the discovery that artifical ubiqui tin- 
protein fusions can be cleaved precisely by a 
cytoplasmic eukaryotic protease with little or no 

dependence on the protein to which- ubiquitin is 
fused can be applied both in vivo and in vitro in 
protein engineering strategies, and is a major 
aspect of this invention. For example, the ubiquitin- 
protein fusion method can be used to. artifically 
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generate authentic amino-termini in proteins pro- 
duced through artifical means. Thus, amino-termini 
characteristic of natural eukaryotic or prokaryotic 
proteins can be generated by in vitro cleavage of 
ubiquitin-protein fusions produced in a prokaryotic 
host. 

A specific methodology for producing ubiquitin- 
^-galactosidase fusion proteins is described in 
detail below. Genes encoding any other proteins can 
be substituted for LacZ (the p -gal gene) in this 
methodology. 

In general , ubiquitin fusion proteins are 
expressed by a chimeric gene construct comprising, 
in 5' to 3 1 orientation, a ubiquitin gene linked to 
a gene encoding the protein of interest. The codon 
for the amino-terminal amino acid of the protein of 
interest is located immediately adjacent the 3 1 end 
of the ubiquitin gene. The fused gene product is 
cleaved endoproteolytically either in vivo or in 
vitro (using either pure br partially purified 
ubiquitin-specific protease identified in the 
present invention) at the junction between ubiquitin 
and the protein of interest to generate the protein 
of interest having the desired amino acid at its 
amino-terminus. There are a number of specific uses 
for the described ability to specifically engineer 
the protein's amino-terminus. One such use is 
established by the fact that the intracellular 
half-life of the released protein is governed by the 
principles of the N-end rule. Other applications of 



WO 88/02406 PCT/US87/02522 



-14- 

the specific method for engineering the protein 
amino-terminus described herein range from adjusting 
the desired functional properties of a protein of 
interest , to modulating its antigenicity, and again, 
to other uses that can readily be ascertained by 
those skilled in the art. 

This method of generating the desired amino 
acid residue at the amino-terminus of a protein of 
interest involves two novel components: one, the 
use of ubiquitin-protein fusions, and the other, the 
use of ubiquitin-specif ic processing protease that 
has been identified, and whose striking substrate 
requirements were discovered, in this work. Al- 
though the initial identification of the ubiquitin- 
specif ic protease has been made in vivo , the enzyme 
is also relatively stable and active in vitro (in 
extracts), and can readily be purified to homogeneity 
by techniques known to those skilled in the art. 
Furthermore, the substrate specificity of the " 
ubiquitin-specific processing protease is highly 
conserved in evolution, being the same in yeast and 
mammals. The enzyme can be purified chromato- 
graphically from a crude extract by sequential 
chromatography on phosphocellulose, DEAE cellulose, 
and SH Sepharose among other methods known to those 
skilled in the art. Alternatively , the gene- for 
this protease can be cloned by those skilled in the art 
The cloned protease gene can be used either in vivo. 
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or, alternatively, the gene can be overexpressed in 
a suitable host, the overexpressed ubiquit in- 
specific protease purified and used for the same or 
similar purposes in vitro • The discovery of this 
enzyme, and detailed characterization of its sub- 
strate specificity herein provides for the in vitro 
and in vivo use of this enzyme. 

The use of ubiquitin-protein fusions to allow 
the generation of a desired amino acid residue at 
the amino-terminus of a protein of interest can be 
extended to facilitate the purification of such 
proteins from producer cells. A gene can be readily 
constructed that encodes a convenient marker pro- 
tein, such as streptavidin, linked to a ubiquitin- 
protein fusion construct described above. The 
resulting (marker protein) -ubiquitin-protein fusion 
can be simply isolated from producer cells by using 
the preselected property of the marker protein, for 
instance, the known ability of streptavidin to be 
isolatable by affinity chromatography on a biotin 
column. Thus, purified (marker protein) -ubiquitin- 
protein fusion can then be specifically cleaved by 
the ubiquitin-specif ic protease described in this 
invention to generate the final product, a protein 
of interest with the desired amino acid residue at 
its amino-terminus . 

The codon for the amino-terminal amino acid of 
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desired amino acid by, for example, site-directed 
mutagenesis techniques currently standard in the 
field. If the gene encoding the protein of interest 
is a synthetic gene the appropriate 5 r codon can be 
built-in during the synthetic process. Alter- 
natively , nucleotides for a specific codon can be 
added to the 5 1 end of an isolated or synthesized 
gene by ligation of an appropriate DNA sequence to 
the 5 1 ( amino -terminus encoding) end of the gene. 

Ubiquit in-like fusion partners capable of being 
cleaved by the ubiquitin-specific protease can also 
be used. In addition, fusion partners other then 
ubiquitin for masking the amino-terminus of a 
protein of interest can be used. In appropriate 
cases, the fusion proteins can be designed to 
contain a proteolytic cleavage site for a restric- 
tion endoprotease which has suf ficiently narrow 
specificity so that only one target site is cleaved 
in a fusion protein. A crucial property of such a 
protease must be a sufficiently relaxed requirement 
for the nature of the amino acid residue (s) abutting 
the carboxy-terminal side of the cleavage site. The 
target site for cleavage is the junction between the 
fusion partner and the amino-terminus of the protein 
of interest and thus the recognition site for the 
endoprotease is located to provide for cleavage at 
this location. The commerically available protease, 
complement factor X a , exhibits these properties and 
- thus can be used to directly generate proteins with 
predetermined amino acid residues in the ultimate 
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* 

position of their amino-termini (see, K. Nogai and 
H.C. Thogersen Nature 309:810 (1984)). The recog- 
nition site for the endoprotease can be engineered 
into the junction between the masking protein 
sequence and the 3' region encoding the amino- 
terminus of the protein of interest. 

A different and distinct method for engineering 
short-lived proteins is provided in this invention 
by the discovery that ubiquitin-protein fusions, 
such as ubiquitin-Pro- ^-galactosidase fusion (Table 
1) , that cannot be efficiently deubiquitinated are 
metabolically unstable. Thus r by attaching the 
amino-terminal ubiquitin moiety to a protein in a 
way that makes its removal either impossible or 
inefficient, one can destabilize a protein by a 
distinct technique which is qualitatively different 
from the method of generating the desired 
amino-terminus of a protein according to the 
requirements of the N-end rule. Prevention of the 
efficient deubiquitination of a ubiquitin-protein 
fusion can be achieved in several ways, for in- 
stance, by using a proline residue at the 
ubiquitin-protein junction as shown in Table 1, or 
by changing the amino acid sequence of ubiquitin 
near its carboxyl-terminus in such as way that the 
ubiquitin moiety is no longer recognized by the 
ubiquitin-specific processing protease but can still 
be recognized by the rest of the degradative path- 
way. These and other ways to reduce the rate of 
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deubiquitination of a ubiquitin-protein fusion can 
be readily ascertained by those skilled in the art. 

The methods of this invention can be employed, 
inter alia, for regulating the half-life of a 
protein intracellular^. There are many instances 
where this capability is useful; For example, when 
a gene is introduced into a cell for expression 
therein, the expressed product can be designed for a 
long or short half-life depending upon the particu- 
lar need. 

In general, destabilized proteins which have 
short half-lives are more amenable to regulation of 
intracellular levels of the protein. The ability to 
finely regulate the intracellular levels and ac- 
tivity of a protein can be useful in therapy or in 
. the work with in vitro cell cultures. In gene 
therapy, for example, a gene may be introduced into 
a cell to compensate for a genetic deficiency or 
abnormality. The gene can be inserted under control 
of an inducible promoter. Induction results in 
enhanced expression of the gene product and con- 
sequently, higher levels of. the product within the 
cell, if the gene is designed to encode an unstable 
protein, the intracellular concentration of the 
expressed protein will be more quickly responsive to 
a later reduction in the rate of its synthesis 
because it does not persist within the cell, in 
this way, the intracellular level and/or activity of 
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the protein encoded by the inserted gene can be more 
finely regulated. 

j The method of this invention can also be used 

to expand the uses of selectable markers by- 
shortening the time necessary for a phenotype 
related to the marker to become manifest • Toward 
this end, a product encoded by a marker gene can be 
destabilized by altering its amino-terminus ac- 
cording to the N-end rule. In this way selection 
1 for the negative phenotype can be facilitated 

because the product of the marker gene will be more 
quickly extinguished after function of the gene 
encoding the marker is abolished. An example is the 
thymidine kinase (tk) gene The tk gene can be 
engineered to encode a less stable enzyme by intro- 
ducing an appropriate destabilizing amino acid at 
the amino-terminus . Gene mutation resulting in tk" 
phenotype will be more quickly manifested by cells 
because residual tk'will be more quickly degraded. 
This can be especially useful in slow growing cells 
where more time is required to "dilute out" tk 
j synthesized prior to transformation to the tk" type. 

The* principles of protein modification based 
upon the N-end rule may also be employed in the 
design of cytotoxins. Proteinaceous cytotoxins can 
be designed as unstable proteins degradable by the 
N-end rule pathway so that they do not persist after 
their toxic action has been exerted on a target 
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cell. Reducing the lifetime of the toxin reduces 
the likelihood of killing riontargeted cells. 

Discovery of the N-end rule pathway of degra- 
dation allows development of mutant cells having 
mutations in genes encoding essential components of 
the N-end rule pathway. For example, cells can be 
produced that either permanently or conditionally 
are unable to efficiently degrade otherwise short- 
lived proteins. These cells can be used to produce 
desired proteins that ordinarily would be unstable 
within a cell. 

The invention is illustrated further by the 
following detailed description of the elucidation of 
the N-end rule. 

Methods 

Protein Sequencing 

S. cerevisiae cells carrying pUB23 (Fig. 1), 
which encodes ub-Met-^gal (Fig. 3A) ,•' were labeled 
with [ S] methionine, followed by extract pre- 
paration, immunoprecipitation of £gal and elect- 
rophoresis as described below. The wet polyacryla- 
mide gel was subjected to autoradiography, the band 
of ^gal was excised, and the electroeluted ^gal was 
subjected to six cycles of radiochemical sequencing 
by Edman degradation. The sequencing was carried 
out by w. Lane at the MicroChem Facility of Harvard 
University.. 
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Site-directed Mutagenesis 

pUB23 (figure 1) was treated sequentially with 
Acc I, the Klenow fragment of pol I, and Bam HI. A 
fragment containing the Xho I site was purified and 
inserted between a filled-in Hind III site and a BAH 
HI site of the M13mp9 phage DNA. (J. Messing and J. 
Vieira, Gene 19, 263 (1982)), Site-directed muta- 
genesis (M. Smith, Annu. Rev . Genet . 19, 423 (1985)) 
was carried out as described by Kramer, W. et al. 
Nucl. Acids Res . 12, 9441 (1984) using a synthetic 
25-residue oligodeoxyribonucleotide, containing ten 
bases on the 5 1 side and twelve bases on the 3 V side 
of the Met codon of ^gal. All four bases were 
allowed to. occur at the original Met codon positions 
during synthesis. Primary phage plaques were 
screened by hybridization (Wood, N.I. et al. PNAS 
82, 1585 (1985)), with the use of a 12-residue 
oligonucleotide probe spanning the region of codon 
changes and hybridizing to the original sequence. 
Nonhybridizing plaques containing inserts of the 
expected size were sequenced by the chain termina- 
tion method. (Sanger, F. et al. , PNAS 71 5463 
(1977)). To transfer the desired constructs into 
the pUB23 background, replicative form DNA of mutant 
phages was digested with Xho I and Bam HI, and added 
to the same digest of the plasmid pLGSD5-ATG (see 
Fig. 1 and L. Guarente, Methods Enzymol . , 101 181 
(1983)). The ligated mixture was used to transform 
the E. coli strain MC1061. (M.J. Casadaban and S.N. 
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Cohen, J* Mol . Biol. , 138 179 (1980)). Colonies 
containing plasmids of interest (in which the open 
reading frame of ^gal had been restored) were 
recognized by their light blue color on X-gal 
plates . 

Pulse-Chase Experiments 

S. cerevisiae cells of the strain BWG-9a-l (MAT 
his4 ura3 ade6) , transformed (F. Sherman et al. 
Methods in Yeast Genetics Cold Spring Harbor Labo- 
ratory, N.Y. r 1981)) with plasmids of interest were 
grown at 30°c to A 6Q0 of approximately 5 in a medium 
of 2 percent galactose, 0.67 percent Yeast Nitrogen 
Base without amino acids (DIFCO) , adenine (10 jug/ml) 
and amino acids including methionine ( Sherman , F. et 
al • , supra) . Typically, cells from a 5 ml culture 
were harvested by filtration through the well of a 
Millipore microtiter filtration plate , washed 
several times on the filter with "the same medium 
lacking methionine and resuspended in 0 .3 ml of 1 
percent galactose, 50 mM potassium phosphate buffer 
(pH 7.4). C 3 5 S J methionine (50 to 100 /aCL) was then 
added for 5 minutes at 30°C; the cells were col- 
lected by filtration and resuspended on 0.4 ml of 
the growth medium containing cycloheximide at 0.5 
mg/ml. Samples (0.1 ml) were withdrawn at indicated 
times , and added to 0.75 ml of cold buffer A (see 
below for buffer composition) containing leupeptin, 
pepstatin A, antipain, aprotinin and chymostatin 
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(Sigma) , (each at 20jug/ml) in addition to 0.4 ml of 
glass beads. Immediately thereafter, the cells were 
disrupted by vortexing for approximately 3 minutes 
at 4°C; the extracts were centrifuged at 12,000g for 
3 minutes and the radioactivity of acid-insoluble 

35 

S in the supematants was determined* Aliquots of 
the supematants containing equal amounts of the 

35 

total acid-insoluble S were processed for immuno- 
precipitation with a monoclonal antibody to gal. 
Ascitic fluid containing a molar excess of the 
antibody (at least tenfold) was added to each 
aliquot, with subsequent incubation at 4°C for 2 
hours; protein A-Sepharose (Pharmacia) was then 
added, the suspension was incubated with rocking at 
4°C for 30 minutes and centrifuged at 12,000g for 2 
minute. The protein A-Sepharose pellets were washed 
three times in buffer A (see below) containing 0.1 
percent sodium dodecyl sulfate (SDS) , resuspended in 
an SDS, dithiotreitol (DTT) -containing electro- 
phoretic sample buffer (U.K. Laemmli, Nature 227 680 
(1970)), heated at 100°C for 3 minutes, and cen- 
trifuged at 12,000g for 1 minute. Equal aliquots of 
the supematants were subjected to electrophoresis 
in a 7 percent discontinuous polyacrylamide-SDS gel 
(15 by 15 by 0.15 cm) with subsequent flourography. 
In some experiments, the above protocol was not 
used, but the extracts were prepared by boiling 
cells directly in the presence of SDS, with es- 
sentially the same results. 
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Analysis of ub- ^gal proteins produced in E, Coli 

Plasmid pUB23 (Figs. 1 and 3) was introduced 
into DS410, a minicell-producing E. coli strain. 
(N. Stoker , efc al., in Transcr iption and Trans- 
lation; A practical Approach B. Q. Harnes and S. or. 
Higgins, Eds., IRL press, Oxford, 1984, p. 153). 
Minicells were prepared and labeled for 60 minutes 
at 36°C with C 3 5 S] methionine ( 600 Ci/mmole, 
Amersham) as described by N. Stoker et al., supra . 

Labeled minicells were centrifuged, resuspended 
in 2 percent SDS, 10 mtf DTT, 10 mM Na-HEPES (ph 7.5} 
and heated at 100°C for 3 minutes. After centri- 
fugation at 12,000g for 1 minute the supernatant was 
diluted 20-fold with buffer A (1 percent Triton 
X-100, 0.15 M.NstCl, 5 mM Na-EDTA, 50 xoM Na-HEPES, ph 
7.5) , followed by the addition of phenylmethyl- 
sulfonyl fluoride (PMSP) and N-ethylmaleimide to 0.5 
mM and lOmM, respectively. After 4 hours at 4°C, 
the sample was dialyzed against buffer A containing 
0.5 mM PMSP overnight at 4°C, and processed for 
immunoprecipitation (as described above) . 

Analysis of ub- ^ ?<yal proteins produced in yeast 

S. cerevisiae cells carrying plasmids of interest 
were grown in 800 ml of a uracil-deficient medium, 
then harvested and disrupted with glass beads in 
buffer A containing leupeptin, pepstatin A, anti- 
pain, aprotinin and chymostatin (each at 3 /tg/ml) . 
The extract was centrifuged at 12,000g for 3 
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minutes. Saturated ammonium sulfate was added to 
the supernatant to a final concentration of 57 
percent. After overnight incubation at 4°C, the 
precipitated protein was collected by centrifugation 
at 23,000g for 30 minutes. The pellet was redis- 
solved in buffer A containing protease inhibitors. 
After clarification at 12,000g for 3 minutes, the 
sample was passed through an affinity column which 
had been prepared by crosslinking an IgG fraction 
from an ascitic fluid (containing a monoclonal 
antibody to gal to Affi-Gel 10 (Bio-Rad) . The IgG 
fraction used for crosslinking had been purified 
from the ascitic fluid by affinity chromatography on 
protein A-Sepharose. After washing with buffer A 
lacking Triton X-100, the antibody-bound proteins 
were eluted with 0.25 M glycine-HCl (pH 2.6). The 
eluate was immediately adjusted to pH 7.5 with 1 M 
Na-HEPES (pH 8.5), and thereafter made 0.1 percent 
in SDS. The sample was concentrated by ultrafil- 
tration in Centricon 30 (Amicon) , and subjected to 
electrophoresis in a 7 percent discontinuous poly- 
acrylamide-SDS gel (U.K. Laemmli, Nature (London) 
227, 680 (1970),). Electroblotting of proteins to 
nitrocellulose, and immunoblot analysis with a 
peptide-mediated antibody to ubiquitin were per- 
formed as described by P.S. Swerdlow, D. Finley and 
A. Varshavsky, Analyt . Biochem . 156, 147 (1986). 
The same results were obtained with a different 
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ahtibody to ubiquitin obtained from A. Haas (Univ of 
Milwaukee Med". School) . 

Detailed Description of the Figures 

Figure 1 shows construction of a ubiquitin-lacZ 
gene fusion. pUB2, a pBR322-based genomic DNA clone 
(E. Ozkaynak, et al. Nature 312 , 663 (1984) contains 
six repeats of the yeast ubiquitin-coding sequence 
(open boxes) together with the flanking regions 
(jagged lines) . pUB2 was modified as shown in the 
diagram by placing a Bam HI site six bases down- 
stream from the first ubiquitin repeat. This 
allowed the construction of an in-frame fusion 
(confirmed by nucleotide sequencing) between a 
single ubiquitin repeat and the lacZ gene of the 
expression vector pLGSD5-ATG (called G2 in L. 
Guarente, Methods Enzymol . 101 181 (1983)). The 
term "2 /tm" denotes a region of the pLGSD-ATG that 
contains the replication origin and flanking se- 
quences of the yeast plasmid called 2 /*m circle (See 
L. Guarente, supra) . Figure 3B shows the amino acid 
sequence of the fusion protein in the vicinity of 
the ubiquitin-^gal junction. 

Figure 2 shows that the in vivo half-life of 
gal is a function of its amino-terminal residue, 
(lane a) Minicells isolated from an E. coli strain 
carrying pUB23, the initial ub -lacZ fusion (Figs, l 
and 3B), were labeled with [ 35 S] methionine for 60 
minutes at 36°C, with subsequent analysis of £gal as 
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described. The same result was obtained when the 

labeled minicell SDS extract was combined with an 

unlabeled yeast SDS extract before immunoprecipita- 

tion of gal. (lane b) S. cerevisiae cells carrying 

pUB23 (Fig. 1), which encodes ub-Met-^gal (Fig. 3B) f 

were labeled with [ 3 5 S] methionine for 5 minutes at 

30°C, with subsequent analysis of ^tgal. The same 

result was obtained with the lengths of the 
35 

[ S] methionine labeling periods from 1 to 3,0 
minutes, and with yeast extracts produced either by 
mechanical disruption of cells in the presence of 
protease inhibitors or by boiling the cells directly 
in an SDS-containing buffer. (lane c) Same as lane 
a but with E. coli cells carrying the control 
plasmid p'LGSDS (called Gl in L. Guarente, supra . ) 
which encodes gal. (lanes d to g) S. cerevisiae 
cells carrying pUB23 (Fig. 1) , which encodes ub-Met- 
^gal (Fig. 3 A) , were labeled with [ 3 5 S] methionine 
for 5 minutes at 30°C (lane d) followed by a chase 
in the presence of cycloheximide for 10, 30, and 60 
minutes (lanes e to g) , extraction, immunoprecipita- 
tion, and analysis of ^gal. (lanes h to j) Same as 
lanes d to f , but with ub-Ile-^gal (see Fig. 3 A) . 
(lanes k to m) Same as lanes h to j, but with 
ub-Gln-^Jgal. (lanes n to q) Same as lanes d to g, 
but with ub-Leu-^gal. (lanes r to u) Same as lanes 
d to g, but with. ub-Arg-f gal. Designations: ori; 
origin of the separating gel; ub, ubiquitin; ^gal, 
an electrophoretic band of the ^3gal protein 
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containing a specified amino-terminal residue; in 
this terminology , the Met-/3gal portion of ub-Met- 
pgal is designated as ^ gal. Arrowheads denote a 
metabolically stable, about 90kD degradation product 
of ^gal which is formed apparently as the result of 
an in vivo endoproteolytic cleavage of a proportion 
of short-lived y?gal proteins such as Leu-^?gal and 
Arg-^gal (lanes n to u) • 

Figure 3 shows the changing amino acid residues 
of gal at the ubiquitin- gal junction. (A) The 
initial plasmid, pUB23 (Fig.l) r which encodes 
ub-Met-^gal, was mutagenized as described above to 
convert the original Met codon ATG at the ub-^gal 
junction into codons specifying 19 amino acids other 
than Met.- (The original round of mutagenesis shown 
in Figure 3, produced 15 out of 19 possible sub- 
stitutions. The remaining four substitutions were 
produced later (see Table 1)). The arrowhead in- 
dicates the site of the deubiquitinating in vivo 
cleavage in the nascent fusion protein that occurs 
with all of the fusion proteins except ub-Pro- gal 
(see text) . All of the constructions shown encode 
His as the second gal residue. In addition,, in 
some of the constructions (ub-Met-His-Gly-^gal, 
ub-Met-Gln-Gly-£gal, and ub-Met-Gln-His-Gly-^gal, 
the. last one produced by an insertion mutation; see 
Table 2) , either His or Gin were following Met at 
the ubiquitin-^gal junction, with indistinguishable 
consequences for the metabolic stabilities of the 
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corresponding gal proteins* (B) The amino acid 
sequence (in single-letter abbreviations) of ub-Met- 
pqal, the initial fusion protein (Fig. 1), in the 
vicinity of the ub-^gal junction. Single-letter 
amino acid abbreviations: A, Ala; C, Cys; D, Asp; E f 
Glu; F, Phe; G, Gly; H, His; I, lie; K r Lys; L, Leu; 
M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S r Ser; T, 
Thr; V, Val; W, Trp; Y, Tyr. 

Figure 4 shows that ubiquitin- gal is short- 
lived if not deubiquitinated. (lanes a to g) S. 
cerevisiae cells carrying plasmids encoding 
ub-X-£gal fusion proteins in which X is the residue 
indicated at the top of each lane, were labeled for 
5 minutes at 30°C with [ 3 5 S] methionine, followed by 
extraction, immunoprecipitation and analysis of 
pgal. Fluorographic exposures for these lanes were 
several times longer than those for similar patterns 
in Figure 2 to reveal the multiple ubiquitination of 
short-lived ^gal proteins. (lanes h, i) Fluoro- 
graphic overexposure of lanes n, o in Fig. 2 to 
reveal the "ladder" of multiply ubiquitinated 
Leu- p gal proteins in a pulse-chase experiment (zero 
and 10 minutes chase, respectively), (lane j) Same 
as lanes a to g, but with ub-Pro-^gal. (lane k) 
Same as lane j, but with ub-Gln-/3gal . (lane 1) Same 
as lane j . (lanes m to p) S . cerevisiae cells 
carrying a plasmid encoding ub-Pro-^gal were labeled 
for 5 minutes at 30°C with [ 3 5 S] methionine (lane m) 
followed by a chase in the presence of cycloheximide 
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for 10, 30, and 60 minutes (lanes n to p) . The 
upper small arrow to the right of lane p denotes 
ub-Pro-^gal, a small proportion of which is still 
present after 1 hour chase. The lower small arrow 
indicates an apparently deubiquitinated Pro-^gal 
that slowly accumulates during chase and is meta- 
bolically stable. The dot to the left of lane m 
denotes an endogeneous yeast protein that is pre- 
cipitated in some experiments by the antibody used. 
Square brackets denote the multiply ubiquitinated 
^-gal species (see Fig. 5). Other designation are 
as in Figure 2. 

Figure 5 shows the "ladder" gal species 
containing ubiquitin. (lane a) s. cerevisiae cells 
carrying a plasmid which encodes ub-Gln- 5gal , were 
grown and disrupted, and the extracts processed for 
isolation of sgal proteins by affinity chromato- 
graphy on a column with immobilized antibody to 
^gal. The ^gal proteins thus obtained were 
electrophoresed in a polyacrylamide-SDS gel, trans- 
ferred to nitro-cellulose, and probed with an 
antibody to ubiquitin. (lane b) Same as lane a, but 
with ub-Pro^gal. (lane c) Same as b but a longer 
autoradiographic exposure, (lane d) S. cerevisiae 
cells carrying a plasmid which encodes ub-Leu-flgal 
were labeled with [ 3 5 S] methionine for 5 minutes, 
with subsequent extraction, immunoprecipitation and 
electrophoresis ' of ^pgal (the same sample as in 
Figure 4, lane f) . Square brackets denote the 
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multiply ubiquitinated Gln-0gal species detected 
with antibody to ubiquitin. The arrow indicates the 
band of ub-Pro-^gal, the initial fusion protein seen 
in lanes b and c. The arrowheads indicate the 
position of the band of deubiquitinated ^jgal (de- 
tectable by either Coomassie staining or metabolic 
labeling f but not with antibody to ubiquitin) 
derived from the ub-Gln-pfgal fusion protein. 

Figure 6 shows both prokaryotic and eukaryotic 
long-lived intracellular proteins have stabilizing 
amino acid residues at their amino-termini, whereas 
secreted proteins exhibit a complementary bias. 

(A) 208 long-lived, directly sequenced, intra- 
cellular (noncompartmentalized) proteins with 
unblocked amino-termini from both prokaryotes (77 
proteins) and eukaryotes (131 proteins) were distri- 
buted into three groups according to the nature of 
their amino-terminal residues as defined by the 
N-end rule (Table 1) . All of the long-lived intra- 
cellular proteins examined bear exclusively stabi- 
lizing residues at their amino-termini. In panels B 
to D, analogous diagrams are presented for 243 
secreted eukaryotic proteins (B) , for 37 light and 
heavy immunoglobulin chains (C) , and for 94 secreted 
eukaryotic toxins (D) . Entries in C and D are 
subsets of entries in B. For proteins in B to D r 
the amino-termini compiled correspond, whenever the 
assignment is possible, to the most processed form 
of a protein that is still located within a 
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secreting cell. The data In A to D were manually 
compiled from the entire set of complete protein 
sequences available before 1981. The same con- 
clusions have been recently reached after a more 
detailed "and extensive, computer-assisted tabulation 
of protein amino-termini using the current National 
Biomedical Research Foundation database. The 
amino-terminal residues of Asn, Cys, His, and Trp 
were excluded from the compilation because in vivo 
half -lives of the corresponding gal proteins are 
still unknown (see, however, the legend to Table 1) . 
Inclusion of the residues (Table 1) into a recently 
compilation of the same type did not change the 
original conclusion. Although the amino-terminal 
Pro was also excluded from the compilation, Pro 
appears to be a stabilizing residue for fiqa.1 (Table 
1) , consistent with the frequent presence of Pro at 
the amino-termini of long-lived noncompartmentalized 
proteins . 

Results and Discussion 

Rapid in vivo deubiquitination of a nascent ubi- 
quitin- p cral fusion protein 

Branched ubiquitin conjugates in which the 
carboxyl-terminal glycine of ubiquitin moieties is 
joined via an isopeptide bond to the *-amino groups 
of internal lysine residues in proteins apparently 
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comprise the bulk of ubiquitin conjugates in eu- 
karyotic cells. Joining of ubiquitin to the amino- 
terminal <* -amino groups of target proteins, to yield 
linear ubiquitin conjugates, may also be chemically 
feasible. See A. Hershko, et al . , PNAS USA 81 ; 7021 
(1984) . Whether or not linear ubiquitin-protein 
fusions are actually synthesized in vivo through 
posttranslational enzymatic conjugation of ubiquitin 
to protein amino- termini , such proteins can also be 
produced by constructing appropriate chimeric genes 

* 

and expressing them in vivo. Construction of one 
such gene, which encodes yeast ubiquitin linked to 
gal of Escherichia coli, is shown in Figure 1. 

When this gene is expressed in E. coli , the 
resulting ^gal-containing protein has an apparent 
molecular mass which is approximately 6 kD greater 
that that of the control /?gal, a value consistent 
with the presence of ubiquitin in the protein 
encoded by the chimeric gene. In contrast, when the 
same gene is expressed in yeast, the corresponding 
^gal protein is electrophoretically indistinguish- 
able from the control flgal. This result is in- 
dependent of the length of the [ S] methionine 
labeling period (between 1 and 30 minutes) . 
Furthermore, determination of the amino-terminal 
residue in the putative Met-^gal (half -life, t 1 / 2 20 
hours) by Edman degradation of the in vivo-labeled, 
gel -purified ^J-gal (Figure 2, lane d) directly 
confirmed the presence of the expected Met residue 
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(Figure 3A and Table i, at its amino-terminus. 
independent evidence that ubiguitin cleavage off the 
fusion protein occurs Mediately after the last Gly 

resxdue of ubiquitin is presented belov. We con- 
clude that in yeast, ubiguitin is efficiently 
cleaved off the nascent ubiguitin- fusion protein, 
yielding a deubiguitinated ,gal. CT he absence of 
the deubiguitinatiori reaction in E. coli is con- 
sistent with other lines of evidence indicating that 
prokaryotes lack both ubiguitin and- ubiguitin- 
specific enzymes] . 

The ubiguitin- /J-gai junction ancodsd by the 
chimeric gene, Gl y -M. t (Figure i and 3B) , is 
identical to the junctions between adjacent repeats 
- the polyubiguitin precursor protein, which is 

Z IT?? f Pr ° CMSe<I lnt ° ubiguitin. Thus 

it is X^ely that the same protease, as yet un- 

^aracterized biochemically, is responsible both for 
the conversion of polyubiouitin into mature ubi- 
^bi™T t r £W *** ^^^"on of the nascent 

to inhibit the in vivo deubiguitination of the ubt- 
guxtin-pgal (and thereby to allow analysis of . 

ZTTZ "r"'" ° f * Stabla attach- 
ment to, -gal, woula be to convert the Met residue 

Z£ZL T Action (Figure 3B, 

into other amino acid residues (Figure 31). The 

unexpected results of =„-h 

belov. approach are described 
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The in vivo half -life of p gal is a function of its 
amino- terminal residue . The ATG codon which speci- 
fies the original Met residue of gal at the ubi- 
quitin junction (Figure 3B) was converted by sited 
directed mutagenesis into codons specifying 19 other 
amino acids (See Figure 3A and Table 1) . These 
constructions differ exclusively in the first codon 
of p gal at the ubiquit in- junction (Figure 3A) . 
After each of the 16 plasmids thus designed was 
introduced into yeast, analysis of the corresponding 
gal proteins pulse-labeled in vivo led to the 
following results (Figures 2, 4, and Table l) : 

1) With one exception (see below) , the effi- 
cient deubiquitination of the nascent ubiquitin-^gal 
occurs irrespective of the nature of the amino acid 
residue of /Jgal at the ubiguitin-^gal junction. 
Thus, the apparently ubiquit in-specific protease 
that cleaves the original ubiquitin-^gal protein at 
the Gly-Met junction is generally insensitive to the 
nature of the first residue of y0gal at the junction 
(Figure 3A and Table 1) . This result, in effect, 
makes it possible to expose different amino acid 
residues at the amino-termini of the otherwise 
identical y9gal proteins produced in vivo. 

2) The in vivo half-lives of the^S gal proteins 
thus designed vary from more than 20 hours to less 
than 3 minutes, depending on the nature of the amino 
acid residue exposed at the amino-terminus of y3gal 
(Figures 2, 4, and Table 1). Specifically, 
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deubiquitinated ^gal proteins with either Met, Ser, 
Ala, Thr, Val, Cys or Gly at the amino-terminus have 
relatively long in vivo half-lives of 20 hours or 
more (figure 2, lanes d to g, and Table l) , similar 
to the half-life of a control y?gal whose gene had 
not been fused to that of ubiguitin. In striking 
contrast, the p gal proteins with either Arg, Lys, 
Phe, Leu, Asp or Trp at the amino-terminus have very 
short half-lives, between approximately 2 minutes 
for Arg- yS gal and approximately 3 minutes for Lys- 
/?gal, Phe- /3 gal, Leu- p gal, Asp-^gal, Asn-^gal 
and Trp- p gal (Figure 2, lanes n to u, and Table 1). 
The half-life of £gal proteins with amino-terminal 
residues of either Gin, His or Tyr is approximately 
10 minutes (Figure 2, lanes k to m, and Table 1), - 
while an amino-terminal He or Glu confers on a gal a 
half-life of approximately 30 minutes (Figure 2, 
lanes h to j , and Table l) . Both pulse-chase and 
continuous labeling techniques were used in these 
experiments and yielded similar results. 

The set of individual amino acids can be 
ordered with respect to the half-lives that they 
confer on gal when exposed at its amino-terminus. 
The resulting rule (Table 1) is referred to as the 
"N-end rule". 



» 
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Table 1: The N-end rule 



Residue X in 
ub-X-^gal 



Radius of 
gyration 
of X(A) 



In vivo 
deubiquitination 
of nascent 
ub-X-^gal ^i/2 ° f x "^ al 



Met 



Ala 
Thr 
Val 
Gly 
Cys 



1.80 
1.08 
0.77 
1.24 
1.29 
0 



+ 
+ 

* 

+ 
+ 
+ 
+ 
+ 



20 hours 



lie 
Glu 



1.56 
1.77 



+ 
+ 



30 minutes 



Tyr 
Gin 
His 



2.13 
1.75 



+ 
+ 



10 minutes 



Phe 
Leu 
Trp 
Asp 
Asn 
Lys 



1.90 
1.54 

1.43 

2.08 



+ 
+ 
+ 
+ 
+ 
+ 



3 minutes 
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Arg 2.38 + 2 minutes 



Pro 1.25 -* 7 minutes 



*The rate of in vivo deubiquitination of ub-Pro-/gal 
is extremely low. The t., shown is that of the 
initial ub-Fro-agal fuslfefl protein (see Fig. 4, 
lanes j to p) . 
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Leqend to Table 1 

The N-end rule. In vivo half-lives of £gal 
proteins in the yeast S. cerevisiae were determined 
either by the pulse-chase technique (for short-lived 
gal's; see below) or by measuring the enzymatic 
activity of gal in crude extracts. For the measure 
ments of #gal activity, cells growing in a galactose 
containing medium were transferred to an otherwise 
identical medium lacking galactose and containing 10 
percent glucose. After further growth for at least 
5 hours. at 30 C, the ratio of £gal activities per 
cell before and after shift to glucose was deter- 
mined for each of the ^gal proteins. [ GAL promoter- 
driven expression of the fusion genes (Figs. 1 and 
3) is repressed in glucose medium]. For shorter- 
lived fi gal proteins (t 1/? < 1 hour), the pulse- 
chase technique was useer 3s well (Figs. 2 and 4). 
EJgctrophoretic bands of p gal proteins labeled with 
[ S] methionine in pulse-chase experiments were cut 
out from scintillant-impregnated-dried gels similar 
to those of Figs. 2 and 4, and S in the bands was 
determined. The in vivo decay of short-lived gal 
proteins deviated from first-order kinetics in that 
the rate of degradation was lower when measured at 
later (1 hour) time points of the chase, the lower 
rate reflecting either a time-dependent toxic effect 
of cycloheximide or intrinsic characteristics of the 
in vivo degradation process. [Arrest of translation 
is required for an efficient short-term chase in S. 
cerevisiae because of the amino acid pool equili- 
bration problems related to the presence of vacuoles 
in this organism] . The half -life values listed 
below were determined for the first 10 minutes of 
chase. Several lines of evidence (see description 
of Figs. 4 and 6) suggest that Pro is a stabilizing 
residue. The listed radii of gyration of amino 
acids are from. M. Levitt, J. Mol . Biol . 104 ;59 

(1976). 
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Amino-terminal location of an amino acid is es- 
sential for its effect on gal half-life 

Site-directed mutagenesis was employed to 
insert a codon specifying a "stabilizing" amino acid 
(in this experiment, the Met residue) before the 
first codon of ^gal at the ubiquitin- ^ gal junction 
(Table 2). Insertion of a stabilizing residue (Met) 
before either another stabilizing residue (Thr) or a 
variety of destabilizing residues (Gin, Lys, and 
Arg) at the ubiquitin-^gal junction invariably 
results in a long-iived deubiquitinated figal (Table 
2) . Furthermore, in contrast to ubiquitin-Pro-tfgal 
which is not only short-lived but also resistant to 
deubiquitination (Figure 4, lanes j to p, and Table 
1) , ubiquitin-Met-Pro-^gal is efficiently deubi- 
quitinated in vivo to yield a long-lived Met-Pro- 
^gal (Table 2). These results show that both the 
identity of amino acid residue and its amino- 
terminal location (presumably the presence of a free 
a-amino group) are essential for its effect on *gal 
half life, in addition, these results (Table 2) 
further support the expectation that ubiquitin 
cleavage off the fusion protein occurs immediately 
after the last Gly residue of ubiquitin (Figure 3A) . 
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Table 2: N-ternrinal location of an amino acid 1s 
essential for its effect on 0gal half-life * 



Fusion protein 



t^ of deubiquitinated 
fusion protein 



r 



T 

ub - Thr - /jgal 



ub - Met - Thr - 0gal 



y 



y 

ub - Gin - £ga1 



ub - Met - Gin - igal 



r 



ub - Lys -0gal 



ub - Met - Lys - flgal 



ub i Arg - g gal 
ub - Met - Arg - flgal 



T 



>20 hours 
>20 hours 

a 10 minutes 
>20 hours 

85 3 minutes 
>20 hours 

s 2 minutes 
>20 hours 



T 



ub - Pro - flgal 



ub - Met - Pro - /jgal 



»7 minutes 
>2Q hours 



Amino-terminal location of an amino acid is essen- 
tial for its effect on 0gal half-life. The inser- 
tion mutants were obtained essentially as described 
for the initial set of mutants except that a 32- 
residue oligonucleotide , 5 '-CCCGGGATCCGTGC(G/C/ 
T/) (G/T)CATACCACCTCTTAG was used, containing 14 
bases on the 5* side and 15 bases on the 3 1 side of 
the ambiguous codon inserted behind the Met codon. 
Bases ^ in parentheses denote ambiguities at the 
positions 16 and 17 in the sequence. Half -lives of 
the corresponding ^gal proteins were determined as 
described in the legend to Table l. 
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A long-lived cleavage product of gal is formed 
during decay of short-lived ^gal proteins . 

The electrophoretic patterns of short-lived 
(but not of long-lived) ^gal proteins invariably 
contain a specific, about 90 kD cleavage product of 
ySgal (Figure 2, lanes n to u) which, unlike the 
parental /?gal species, accumulates during the 
postlabeling (chase} period (Figure 4, lanes m-p) . 
The 90 KD y9gal fragment constitutes a relatively 
small proportion of the initial amount of the 
pulse-labeled ^gal . . Nonetheless, its existence 
implies that an in vivo endoproteolytic cleavage can 
rescue a protein fragment from the metabolic fate of 
its short-lived parental protein. It remains to be 
seen whether the resulting possibility of multiple 
half~lives within a single protein species is 
exploited in the design of naturally short-lived 
proteins. 

Ubicmiti n-jggal is short-lived when not deubiqui- 
tinated. 

Ubiquitih-Pro-^gal , the only ubiquitin-^gal 
fusion that is not deubiquitinated in vivo (Fig. 4, 
lanes j to p) , has a half-life of approximately 7 
minutes (Table 1) which is less than 1 percent of 
the half -life of metabolically stable^ gal proteins 
(Table 1) . One interpretation of this result is 
that a metabolically stable ubiquitin attachment to 
protein amino- termini is sufficient to signal 
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degradation of acceptor proteins. This inter- 
pretation is consistent with earlier biochemical and 
genetic evidence that ubiquitination of short-lived 
proteins in a mammalian cell is essential for their 
degradation. At the same time, all ubiquitin-^gal 
fusion, proteins other than ubiquitin-Pro-^gal are 
rapidly deubiquitinated in vivo (Table 1) . Thus, 
the posttranslational amino-termal ubiquitination of 
proteins may not be involved in an initial recogni- 
tion or commitment step that designates proteins for 
degration in vivo. Whether posttranslational 
amino-terminal ubiquitination (if it actually occurs 
in vivo) is essential for later stages of the 
degradation pathway remains to be determined. 
Earlier in vitro eocperiments indicated that prefer- 
ential chemical modification of amino-termini of 
proteolytic substrates inhibits their degradation in 
an in vitro ubiquitin-dependent proteolytic system. 
Based on these data, it was proposed that amino-ter- 
minal ubiquitination of proteins is essential for 
their degradation. An alternative interpretation of 
the same results is that chemical blocking of 
proteins 1 amino-termini prevents the recognition of 
their amino-terminal residues by the "N-end rule" 
pathway whose initial stages are not necessarily 
ubiquitin-dependent . 
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Short-llv ed 
in vivo. 



ral proteins are multiply ubiquitinated 



Over exposures of the pulse-chase fluorograms 
(Fig. 2 J reveal that the major band of a deubi- 
quitinated, short-lived p gal protein coexists with a 
"ladder" of larger molecular mass, * gal-containing 
bands irregularly spaced at 4 to 7 kD intervals 
(Fig. 4, lanes c to g) . Mo such larger species 
appear when the fluorograms of long-lived B gal 
proteins are similarly overexposed (Fig. 4, lanes a 
and b) . Immunological analysis with both antibodies 
to ^gal and antibodies to ubiquitin demonstrates 
that the "ladder" ^gal species contain ubiquitin 
(Fig. 5) . 



A model for the s elective degradation pathway . 

With, the exception of natural or engineered 
ubiquitin fusion proteins (Fig. 1 and Table 1) ^ 
nascent proteins apparently lack ubiquitin moieties. 
The in vivo amino-terminal processing of nascent 
noneompartmentalized proteins generates their mature 
amino-termini via the action of amino-termihal 
peptidases whose substrate specificities have been 
partially characterized. (See Tsunasawa, s. et al. 
J. Biol. Chem. 260 5382 (1985) ; Boissel, J.P. et al. 
PNAS USA 82 , 8448 (1985) ) . We suggest that the 
amino-termini thus generated are recognized by an 
"W-end-reading" enzyme. One specific model is that 
a commitment to degrade a protein molecule is made 
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as a result of the recognition of its amino-terminal 
residue by a stochastically operating enzyme whose 
probability of "clamping" at the target's amino- 
terminus is determined by the N-end rule. (Table 1) . 
Once the commitment is made, it is followed by a 
highly processive ubiquitination of the target 
protein which in the case of /Jgal is conjugated to 
more than 15 ubiquitin moieties per molecule of gal 
(Fig. 4, lanes c to g, and Fig. 5). The multiply 
ubiquitinated target protein is then degraded by a 
"down stream" enzyme (1) for which the ubiquitin 
moieties of the target serve as either recognition 
signals or denaturation (unfolding) devices, or 
both. 

The ubiquitin-containing "ladder" tfgal species 
(Fig. 4, lanes c to 1, and Fig. 5) consist of 
apparently branched ubiquitin moeities joined to the 
£ -amino groups of internal lysine residues in p gal. 
Surprisingly, the "ladder" ^gal species derived from 
ubiquitin-Pro-yjgal are electrophoretically indistin- 
guishable from the analogous species of jgal whose 
amino-terminal ubiquitin is cleaved off the nascent 
fusion protein (Fig. 4, lanes j to 1, and Fig. 5). 
If the electrophoretically indistinguishable ubiqui- 
tinated p gal species are indeed structurally homo- 
logous, these results would be compatible with two 
alternative models in which, immediately after the 
first ubiquitins are branch-conjugated to /Jgal, 
either a branch-ubiquitinated ubiquitin-Pro-flgal 



J 
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undergoes amino-terminal deubiquitination or, 
alternatively, and analogous ^gal species lacking 
the amino-terminal ubiquitin moiety reacquires it. 
Experimental . resolution of this ambiguity may 
establish whether the posttranslational amino- 
terminal ubiquitinatiori of proteins (if it occurs in 
vivo) plays a role in tie selective protein turn- 
over. 

Although both prokaryotic and eukaryotic 
) proteins appear to follow the IT-end rule (see 

below) , bacteria apparently lack the ubiquitin 
system. Thus it is possible that the hypothetical 
N-end-recognizing protein is more strongly conserved 
between prokaryotes and eukaryotes than is the rest 
of the selective degradation pathway* Interest- 
ingly, the properties of a mammalian protein E3 
whose presence is required for ubiquitination of 
proteolytic, substrates by ubiqui tin-conjugating 

* 

enzymes in vitro are consistent with it being a 
component of the N-end-recognizing protein. 




intracellular proteins . 

The unblocked- amino-terminal residues in 
metabolically stable, noncompartmentalized proteins 
from both prokaryotes and eukaryotes are exclusively 
(Fig. 6A) of the stabilizing class (Met, Ser, Ala, 
Gly, Thr, Val), that is, the class that confers long 
in vivo half-lives on gal (Table 1). The one 
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short-lived intracellular protein for which the 
mature amino-terminus is known is the ell protein of 
phage lambda , the central component of a trigger 
that determines whether A grows lytically or 
lysogenizes an infected cell. (Y.S. Ho, D. Wulff, 
M. Rosenberg, in Regulation of Gene Expression , I, 
Booth and C. Higgins, Eds. (Cambridge Univ. Press, 
London, 1986), p. 79; F. Banuett, M.A. Hoyt, L. 
McFarlane, H. Echols, I. Herskowitz, J. Mol. Biol. 
187, 213 (1986); M.A. Hoyt, D.M. Knight, A. Das, 
H.I. Miller, H. Echols, Cell 31 , 565 (1982); K. 
Nasmyth, Nature (london) 320 , 670 (1983)). The 
half-life of ell in lambda-infected E. coli is less 
than 3 minutes. Strikingly, the mature amino- 
terminus or ell starts with Arg (Ho, Y.W. et al . , J. 
Biol. Chem. 257, 9128 (1982)), the most destablizing 
residue in the N-end rule (Table 1) . 

While the destabilizing amino acids can be 
either hydrophobic, uncharged hydrophilic or charg- 
ed, they share the property of having larger radii 
of gyration than any of the stabilizing amino acids 
except Met (Table 1) . 

Amino-terminal residues in compartmentalized pro- 
teins are largely of the destabilizing class . 

Figure 6 illustrates a striking difference 
between the choice of amino-terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins (A) and in compartmentalized proteins, such 
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as secreted proteins (B) , many of which are also 
long-lived in their respective extracellular com- 
partments. One implication of this finding is that 
a single intracellular degradation pathway operating 
according to the N-end rule could be responsible 
both for the diversity of in vivo half-lives of 
intracellular proteins and for the selective de- 
struction of compartmentalized proteins that are 
aberrantly introduced into the intracellular space. 
Some miscompartmentalized proteins may be more 
harmful to the ceil than others. It is therefore of 
interest that secreted eukaryotic toxins contain 
strongly destabilizing residues (Arg, Lys, Leu, Phe, 
Asp) at their amino-termini more often than the 
general population of secreted proteins (Fig. 6, 
panels B. to D) . . 

The above consideration also suggest that, if 
the topological outside of a cell, such as lumens of 
the endoplasmic reticulum and Golgi> and the extra- 
cellular space, were to have degradation pathways 
analogous to the N-end rule pathway, they could be 
based on "inverted" versions of the N-end rule in 
which, the amino-terminal residues that are de- 
stabilizing inside the cell are now the stabilizing 
ones and vice versa. Thus, the methods of the ' 
present invention should also be useful for manipu- 
lating the metabolic stability and other properties 

of compartmentalized proteins, including secreted 
ones. 
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Possible role of the N-end rule pathway in the 
turnover of loner-lived proteins . 

Long-lived intracellular proteins with destabilizing 
(Table 1) penultimate residues generally retain 
their initial amino-terminal methionine residue. 
The amino-terminal residues in long-lived intracel- 
lular proteins that do undergo amino-terminal 
processing are invariably of the stabilizing class 
(Table 1) . An interesting possibility that would 
involve the N-end rule pathway in the turnover of 
long-lived proteins is that the rate-limiting step 
in the in vivo degradation of long-lived proteins 
may be a slow aminopeptidase cleavage that exposes a 
destabilizing residue, followed by rapid degradation 
via the N-end rule pathway. Note that fine-tuning 
of the rate of degradation may in this case be a 
function of the rate of aminopeptidase cleavage 
exposing a destabilizing residue rather than a 
function of the residue's destabilizing capacity 
according to the N-end rule. 

The N-en d rule and selective degradation of short- 
lived and damaged proteins . 

The recognition of polypeptide chain folding 
patterns or of .local chemical features that target 
an otherwise long-lived but damaged protein for 
selective degradation in vivo is unlikely to be 
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mediated directly by the N-end rule pathway. 
Instead, we suggest that specific proteases (ana- 
logous in function to nucleases that recognize 
specific lesions in DNA) cleave a targeted protein 
so as to Expose a destabilizing residue at the 
amino-terminus of one of the two products of a cut. 
One testable prediction of this model is that the 
initial cleavage products of the degradation pathway 
should bear destabilizing residues at their N-ter- 
mini. The preferential exposure of destabilizing 
residues at the antino-termini of products of the 
initial protein cleavages may be due either to 
intrinsic specificities of the proteases involved or 
simply to the fact that a majority of the amino 
acids belong to the destabilizing class (Table 1). 
Furthermore r initial cleavages of a protein would be 
expected to destabilize aspects of its original 
conformation , thus increasing the probability of 
further internal cuts. Whether the initial cleavage 
products of a protein would be degraded exclusively 
via the N-end rule pathway or would have to be 
processed further by additional internal cleavages 
should depend on several factors r such as the 
exposure of -destabilizing residues at the aminb- 
termini of initial cleavage products , and the 
relative rates of introduction of internal cuts. In 
this model, the tf-end rule pathway should be es- 
sential for degradation of most of the metabolically 
unstable proteins, from chemically damaged, 
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prematurely terminated, improperly folded and 
miscompartmentalized ones to those that cannot 
assemble into native multisubunit aggregates, and 
finally to otherwise normal proteins that are 
short-lived in vivo. Thus, the metabolic in- 
stability of a protein may be mediated not only by 
the exposure of a destabilizing residue at its 
• amino-terminus , but also by local conformational and 
chemical features of its polypeptide chain that 
result in proteolytic cleavages exposing destabiliz- 
ing residues at the amino-termini of cleavage 
products . 

* # 

For any given protein, a variety of factors in ' 
addition to the N-end rule may combine to modulate 
its half-life in vivo. Among such factors may be 
the flexibility and accessibility of the protein's 
amino-terminus (Thornton, J.M. and Sibanda, B.L., J. 
Mol. Bio . 167 443 (1983)), the presence of chemical- 
ly blocking amino-terminal groups such as the acetyl 
group, the distribution of ubiquitinatable lysine 
residues near the amino-terminus, and other variabl- 
es, such as the structure of the carboxy-terminus . 
Since amino-terminal regions of multisubunit pro- 
teins are commonly involved in the interfaces 
between subunits (Thornton, J.M. and Sibanda, B.L. , 
J. Mol . Bio . 167 443 (1983)), guarternary structure 
of proteins is yet another parameter that is ex- 
pected to modulate the impact of the N-end rule 
pathway on protein half-lives in vivo. Finally as 
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suggested above, the N-erid rule pathway may also be 
essential for the degradation of proteins whose 
initial recognition as targets for degradation is 
independent of the structures at their amino- 
termini.. 



Functional signifi cance of posttranslational addi- 
tion of amino acids to am i no-termini of proteins . 
It has been known for many years that in both 
bacteria and eukaryotes there exists an unusual 
class of enzymes, aminoacyl-transfer RNA-protein 
transferases, which catalyze posttranslational 
conjugation of specific amino acids to the mature 
amino-termini of acceptor proteins in vitro (R.r,. 
Soffer, in Transfer R NA; Biological Aspects . D. Soil, 
J.N. Abelson, p.r. schimmel, Eds* (Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY 1980) , 
p493;C. Deutch, Methods Enzy mol . iQ6 , 198(1984): A. 
Kaji, H. Kaji, G.D. Nbvelli, J. Biol. Chem . 240 , 
1185 (1965)) . The posttranslational addition of 
amino acids to proteins in vivo dramatically ac- 
celerates in a stressed or regenerating tissue, for 
example, after physical injury to axons of nerve 
cells (s. Shyne-Athwal, . R.v. Riccio, G. Chakraborty, 
N.A. Ingolia, Science 231, 603 (1986) ; N.A. Ingolia 
et al. , J. Neurosci 3, 2463 (1983) ) . The N-end rule 
provides an explanation for this phenomenon. We 
suggest that selective changes in metabolic sta- 
bility of otherwise undamaged, longlived proteins 
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that may be required by a changed physiological 
state of the cell are brought about by post- 
translational addition of destabilizing amino acids 
to the amino-termini of target proteins in vivo. 
Strikingly, the known reactions of posttranslational 
addition of amino acids to proteins (R.L. Soffer, in 
Transfer RNA: Biological Aspects , D. Soil, J.N. 
Abelson, P.R. Schimmel, Eds. (Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY 1980), p493;C. 
Deutch, Methods Enzymol . 106 , 198(1984): A. Kaji, H. 
Kaji, G.D. Novelli, J. Biol. Chem . 240 , 1185 (1965); 
S. Shyne-Athwal , R.V. Riccio, G. Chakraborty, N. A. 
Ingolia, Science 231 , 603 (1986); N.A. Ingolia et 
al., J. Neurosci 3, 2463 (1983)) involve largely 
those amino acids (Arg, Lys, Leu, Phe, and Tyr) that 
are destabilizing according to the N-end rule (Table 
1) • Physiological states in which addition of 
destabilizing amino acids to proteins could be 
expected to occur include entry to and exit from the 
cell cycle, responses to chemical or physical 
stress, and specific differentiation events, such as 
erythroid differentiation and spermatogenesis, in 
which a proportion of preexisting, otherwise long- 
lived intracellular proteins is selectively de- 
graded*. 

The in vitro degradation of some proteolytic 
substrates in a ubiquitin-dependent system from 
* mammalian reticulocytes has recently been shown to 

depend on the presence of certain aminoacyl-tRNAs 
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(Ferber, S. and Crechanover, A. r J. Biol . Chem . 261 
3128 C1986) ) . We suggest that this phenomenon also 
reflects a requirement for posttranslational ad- 
dition of specific destabilizing amino acids to the 
amino-termini of proteolytic substrates. The 
initial proteolytic substrates in question have 
amino-terminal residues of Asp or Glu, both of which 
are destabilizing according to the N-end rule 
(Table 1) • This raises an interesting and testable 
possibility that certain amino-terminal residues in 
. proteins may not be directly destabilizing as such 
but only through their ability to be conjugated to 
other destabilizing residues. 

Equivalents 

Those skilled in the art vd.ll recognize, or be 
able to ascertain using no more than routine experi- 
mentation, many equivalents to the specific embodi- 
ments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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CLAIMS 



1. A method of regulating the stability of an 

intracellular protein, comprising modifying the 
amino-terminus of a protein to provide for 
exposure at the amino-terminus of a desired 
amino acid of the stabilizing or destabilizing 
class according to the N-end rule of protein 
degradation. 

2. A method of destabilizing an intracellular 
protein, comprising modifying the protein to 

* * 

provide an amino-terminal amino acid of the de- 
stabilizing class. 

3. A method of Claim. 2, wherein an amino acid of 
the destabilizing class is selected from the 
group consisting of Isoleucine, Glutamic acid, 
Tyrosine, Glutamine, Phenylalanine, Leucine, 
Asparginine, Lysine or Arginine. 

4. A method of producing a protein which is more 
or less resistant to degradation than its 
wild- type counterpart, comprising introducing 
into the amino-terminus of the wild-type 
protein a desired amino acid of the stabilizing 
or destabilizing class according to the N-end 
rule of protein degradation to produce a 
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modified protein having a stabilizing or 
destabilizing amino acid at its amino-terminus . 

5. A method of Claim 4, wherein the modified 
protein is produced as a fusion protein con- 
taining a masking protein joined to its amino- 
terminus, the masking protein being cleavable 
at the junction with the aminq-terminus of the 
modified protein. 

6. A method of Claim 5, wherein the masking 
protein is ubiquitin. 

7. A method of Claim 6, wherein ubiquitin-specif ic 
protease is u£ed to cleave ubiquitin from the 

. fusion protein at the ubiquitin-protein junc- 
tion either in vivo or in vitro . 

8. a method of Claim 4, wherein the masking 
protein is joined to the amino-terminus of the 
modified protein through a protein sequence 
comprising a site for recognition by an endo- 
protease. 

9. A method of Claim 8, wherein the endoprotease 

is complement factor X . 

a 

10. A method of producing a metabolically unstable 
protein comprising producing the protein as a 
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fusion protein wherein ubiquitin is joined to 
the amino-terminus of the protein in a manner 
such that the fusion protein cannot be ef- 
ficiently deubiquitinated. 

11. - A method, of Claim 10, wherein the fusion 

protein" comprises ubiquitin-proline-protein. 

12. A method of Claim 10, wherein the fusion 
protein contains ubiquitin having a modifica- 
tion in the amino acid sequence recognized by 
ubiquitin-specific processing protease so as to 
reduce the efficiency of deubiquitination of 
the fusion protein by the protease. 

13. A gene construct encoding a fusion protein, the 
gene construct comprising a DNA sequence 
encoding a masking protein joined to a DNA 
sequence encoding a protein of interest having 
a predetermined amino acid of the stabilizing 
or destabilizing class at its amino-terminus, 
the masking protein being cleavable at the 
junction with the amino-terminus of the protein, 
of interest such that cleavage results in 
exposure of the amino-terminus of the protein 
of interest. 

14. A method of Claim 13, wherein the masking 
protein is ubiquitin. 
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A method of producing a protein having a 
predetermined amino- terminal structure , com- 
prising expressing the protein as a fusion 
protein wherein the amino- terminus of the 
protein is fused to a masking protein which is 
specif ically cleavable in vivo or in vitro at 
the junction with the aaino-terminal amino acid 
of the protein. 

] 16. A method of Claim 15 , wherein the masking 

Protein is cleavable by a endoprotease. 

17. A method of Claim 16 ^ wherein the masking 
protein is ubiquitin and the cleaving endo- 
protease is ubiquitin-specif ic protease. 

j 

18. A method of producing a protein having a 

predetermined amino- terminal structure , com- 
prising: 

a. preparing a DNA construct encoding a 
fusion protein comprising; 

i) a structural gene encoding the 
protein , the gene having a sequence 
at its 5 1 end encoding the pre- 
determined amino-terminal structure 
of the protein; and 

ii) DNA encoding a masking protein linked 
to the 5 1 end of the structural gene 
encoding the protein, the masking 



15. 

i 
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protein being specifically cleavable 
in vivo or in vitro at its junction 
with the amino-terminus of the 
protein; and 
b. expressing the DNA construct in a host 

cell to produce the fusion protein encoded 

by the DNA construct. 



19. A method of Claim 18, wherein the fusion 
protein is cleaved within the host cell to 
release the protein having the predetermined 
amino-terminal structure. 

20. A method of Claim 18, wherein the fusion 
protein is produced in a cell which lacks the 
capability to cleave the masking protein and 
the fusion protein is cleaved in vitro to 
release the protein having the predetermined 
amino- terminal structure . 

21. A method of Claim 18, wherein the masking 
protein is ubiquitin. 

22. A DNA construct for producing a protein having 
a predetermined amino-terminal structure, 
comprising: 

a. a structural gene encoding the protein, 
the gene having a sequence at its 5' end 
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encoding the predetermined amino-terminal * 
structure of the protein; and 
b. DNA encoding a masking protein linked to 

tlie 5 1 end of the structural gene encoding 
the protein, the masking protein being 
specifically cleavable in vivo or in vitro 
at its junction with the amino-terminus of 
the protein. 

23. A method of Claim 22, wherein the masking 
protein is ubiquitin. 

24. Isolated ubiquitin-specif ic protease capable of 
cleaving ubiquitin from the amino-terminus of a- 
proteitf to which it is joined at the junction 
between ubiquitin and the amino-terminus of the 
protein. 

25. An isolated DNA sequence encoding ubiquitin- 
specif ic protease of Claim 24. 

26. A mutant cell line which is conditionally or 
unconditionally incapable of synthesizing one 
or more proteases involved in the N-end rule 
degradative pathway. 
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